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1 .4 Analysis of Algorithms 

► introduction 

► observations 

► mathematical models 

► order-of-growth classifications 

► theory of algorithms 

► memory 



Cast of characters 



Programmer needs to develop 
a working solution. 




Client wants to solve 
problem efficiently. 



Student might play 
any or all of these 
roles someday. 




Theoretician wants 
to understand. 


B Basic blocking and tackling 
9 is sometimes necessary, 
[this lecture] 



Running time 


As soon as an Analytic Engine exists, it will necessarily guide the future 
course of the science. Whenever any result is sought by its aid, the question 
will arise — By what course of calculation can these results be arrived at by 
the machine in the shortest time? ” — Charles Babbage (1864) 




how many times do you 
have to turn the crank? 


Analytic Engine 
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Reasons to analyze algorithms 


Predict performance. 
Compare algorithms. 
Provide guarantees. 




Understand theoretical basis. 


this course 


theory of algorithms 


Primary practical reason: avoid performance bugs. 



client gets poor performance because programmer 
did not understand performance characteristics 
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Some algorithmic successes 


Discrete Fourier transform. 

• Break down waveform of N samples into periodic components. 

• Applications: DVD, JPEG, MRI, astrophysics, .... 

• Brute force: N 2 steps. 

• FFT algorithm: NlogN steps, enables new technology. 



Friedrich Gauss 
1805 


time 
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Some algorithmic successes 


N-body simulation. 

• Simulate gravitational interactions among N bodies. 

• Brute force: N 2 steps. 

• Barnes-Hut algorithm: NlogN steps, enables new research. 



Andrew Appel 
PU '81 


time 



m 
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The challenge 


Q. Will my program be able to solve a large practical input? 



Why does it run out of memory ? 


Insight. [Knuth 1970s] Use scientific method to understand performance. 



Scientific method applied to analysis of algorithms 


A framework for predicting performance and comparing algorithms. 

Scientific method. 

• Observe some feature of the natural world. 

• Hypothesize a model that is consistent with the observations. 

• Predict events using the hypothesis. 

• Verify the predictions by making further observations. 

• Validate by repeating until the hypothesis and observations agree. 


Principles. 

• Experiments must be reproducible. 

• Hypotheses must be falsifiable. 

Feature of the natural world. Computer itself. 
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1 .4 Analysis of Algorithms 


^introduction > 

► observations 

► mathematical models 



c icatk?ns 


Example: 3-SUM 


3-Sum. Given N distinct integers, how many triples sum to exactly zero? 


% more 8ints.txt 

8 

30 -40 -20 -10 40 0 10 5 

% java ThreeSum 8ints.txt 

4 


a[i ] 

a[j] 

a[k] 

sum 

30 

-40 

10 

0 

30 

-20 

-10 

0 

-40 

40 

0 

0 

-10 

0 

10 

0 


Context. Deeply related to problems in computational geometry. 


3-SUM: brute-force algorithm 


public class ThreeSum 

{ 

public static int count(int[] a) 

{ 

int N = a. length; 
int count = 0; 

check each triple 

for simplicity, ignore 
integer overflow 


for (int i =0; i < N; i++) 

for (int j = i+1; j < N; j++) 
for (int k = j+1; k < N; k++) 
if (a[i ] + a[j] + a[k] == 0) 
count++ ; 
return count; 


+- 

+■ 


} 


public static void mai n(String[] args) 

{ 

int[] a = In . readlnts(args [0] ) ; 
StdOut . pri ntl n(count(a)) ; 

} 



Measuring the running time 


Q. How to time a program? 
A. Manual. 



% java ThreeSum lKints.txt 



% java ThreeSum 2Kints.txt 



tick tick tick tick tick tick tick tick 
tick tick tick tick tick tick tick tick 
tick tick tick tick tick tick tick tick 


528 


% java ThreeSum 4Kints.txt 


tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 

tick 


4039 
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Measuring the running time 


Q. How to time a program? 

A. Automatic. 

public class Stopwatch (part of stdiib.jar) 

Stopwatch 0 create a new stopwatch 

double elapsedTimeQ time since creation (in seconds) 


public static void mai n(Stri ng [] args) 

{ 

i nt [] a = In . readlnts(args [0] ) ; 
Stopwatch stopwatch = new Stopwatch (); 
StdOut . pri ntl n(ThreeSum. count (a)) ; 
double time = stopwatch . el apsedTimeO ; 

} 
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Empirical analysis 


Run the program for various input sizes and measure running time. 


Empirical analysis 


Run the program for various input sizes and measure running time. 


N 

time (seconds) t 

250 

0.0 

500 

0.0 

1,000 

0.1 

2,000 

0.8 

4,000 

6.4 

8,000 

51.1 

16,000 

? 
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Data analysis 


Standard plot. Plot running time T(N) vs. input size N. 



Data analysis 


Log-log plot. Plot running time T(N) vs. input size N using log-log scale. 



IK 2K 4K 8K 

lg N 


\g(T(N)) = b lg N + c 
b = 2.999 
c = -33.2103 

T (N) = a N b , where a = 2 C 


power law 


Regression. Fit straight line through data points: aN b . ^ sl °P e 
Hypothesis. The running time is about 1.006 x 10 10 x n 2 - 999 seconds. 


Prediction and validation 


Hypothesis. The running time is about 1.006 x 10 10 x n 2 ^ seconds. 


"order of growth" of running 
time is about N 3 [stay tuned] 

Predictions. 

• 51.0 seconds for N = 8,000. 

• 408.1 seconds for N= 16,000. 


Observations. 


N 

time (seconds) + 

8,000 

51.1 

8,000 

51.0 

8,000 

51.1 

16,000 

410.8 


validates hypothesis! 


Doubling hypothesis 


Doubling hypothesis. Quick way to estimate b in a power-law relationship. 
Run program, doubling the size of the input. 


N 

time (seconds) t 

ratio 

Ig ratio 

250 

0.0 


— 

500 

0.0 

4.8 

2.3 

1,000 

0.1 

6.9 

2.8 

2,000 

0.8 

7.7 

2.9 

4,000 

6.4 

8.0 

3.0 

8,000 

51.1 

8.0 

3.0 

i 1 


t 


seems to converge to a constant b ~ 3 

Hypothesis. Running time is about aN b with b = Ig ratio. 

Caveat. Cannot identify logarithmic factors with doubling hypothesis. 


Doubling hypothesis 


Doubling hypothesis. Quick way to estimate b in a power-law relationship. 
Q. How to estimate a (assuming we know b ) ? 

A. Run the program (for a sufficient large value of N) and solve for a. 


N 

time (seconds) t 

8,000 

51.1 

8,000 

51.0 

8,000 

51.1 


51.1 = a x 8000 3 
^ a = 0.998 x 10 - 10 


Hypothesis. Running time is about 0.998 x 10 ~ 10 x N 3 seconds. 

t 

almost identical hypothesis 
to one obtained via linear regression 
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Experimental algorithmics 


System independent effects. 

• Algorithm. 

• Input data. 


determines exponent b 
in oower law 


'N 


1 determines constant 

System dependent effects. > in power law 

• Hardware: CPU, memory, cache, ... 

• Software: compiler, interpreter, garbage collector, ... 

• System: operating system, network, other apps, ... J 


Bad news. Difficult to get precise measurements. 

Good news. Much easier and cheaper than other sciences. 



e.g., can run huge number of experiments 
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1 .4 Analysis of Algorithms 


^introduction > 

► observations 

► mathematical models 



c icatk?ns 
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1 .4 Analysis of Algorithms 



Mathematical models for running time 


Total running time: sum of cost x frequency for all operations. 

• Need to analyze program to determine set of operations. 

• Cost depends on machine, compiler. 

• Frequency depends on algorithm, input data. 









nil; CLASSIC W(HIK 

M vm 1 PI1ATEDASI) REVlStD 

The Classic work 

NEWLY UPDATED AND REVISED 


The Classic work 

NEWLY UPDATED AND REVISED 


THE CLASSIC WORK 

EXTENDED AND REPINED 

The Art of 
Computer 
Programming 

WU 'ME 1 

luiuljimnu! .Algorithms 

third Edition 

The Art of 
Computer 
Programming 

VOLUME 2 

Seminumerical Algorithms 

Third Edition 


The Art of 
Computer 
Programming 

VOLUME 3 

Sorting and Searching 

Second Edition 


The Art of 
Computer 
Programming 

VOLUME 4a 

Combinatorial Algorithms 

Part 1 

DONAIJ) E. KNUTH 

DONALD E. KNUTH 


DONALD E. KNUTH 


DONALD E. KNUTH 









Donald Knuth 
1 974 Turing Award 


In principle, accurate mathematical models are available. 
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Cost of basic operations 


operation 

example 

nanoseconds + 

integer add 

a + b 

2.1 

integer multiply 

a * b 

2.4 

integer divide 

a / b 

5.4 

floating-point add 

a + b 

4.6 

floating-point multiply 

a * b 

4.2 

floating-point divide 

a / b 

13.5 

sine 

Math . si n (theta) 

91.3 

arctangent 

Math.atan2(y, x) 

129.0 


f Running OS X on Macbook Pro 2.2GHz with 2GB RAM 
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Cost of basic operations 


operation 

example 

nanoseconds + 

variable declaration 

int a 

Cl 

assignment statement 

a = b 

C2 

integer compare 

a < b 

C3 

array element access 

a[i] 

C4 

array length 

a. length 

C5 

1 D array allocation 

new int[N] 

c 6 N 

2D array allocation 

new i nt [N] [N] 

c 7 N 2 

string length 

s . lengthO 

C8 

substring extraction 

s . substri ng(N/2 , N) 

C9 

string concatenation 

s + t 

cio N 


Novice mistake. Abusive string concatenation. 
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Example: 1-SUM 


Q. How many instructions as a function of input size N? 

int count = 0; 
for (int i =0; i < N; i++) 
if (a[i] == 0) 
count++; 


operation 

frequency 

variable declaration 

2 

assignment statement 

2 

less than compare 

N + 1 

equal to compare 

N 

array access 

N 

increment 

N to 2 N 
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Example: 2-SUM 


Q. How many instructions as a function of input size N? 



operation 

frequency 

variable declaration 

N + 2 

assignment statement 

N + 2 

less than compare 

!4 (N + 1 ) (N + 2) 

equal to compare 

J4 N (N - 1 ) 

array access 

N (N - 1) 

increment 

!4 N (N - 1 ) to N (N - 1 ) 


+ 1) = (N -1) 



tedious to count exactly 
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Simplifying the calculations 


“ It is convenient to have a measure of the amount of work involved 
in a computing process, even though it be a very crude one. We may 
count up the number of times that various elementary operations are 
applied in the whole process and then given them various weights. 
We might, for instance, count the number of additions, subtractions, 
multiplications, divisions, recording of numbers, and extractions 
of figures from tables. In the case of computing with matrices most 
of the work consists of multiplications and writing down numbers, 
and we shall therefore only attempt to count the number of 
multiplications and recordings. ” — Alan Turing 


ROUNDING-OFF ERRORS IN MATRIX PROCESSES 

By A. M. TURING 

(National Physical Laboratory , Tedding ton, Middlesex) 

[Received 4 November 1947] 

SUMMARY 

A number of methods of solving sets of linear equations and inverting matrices 
are discussed. The theory of the rounding-off errors involved is investigated for 
some of the methods. In all cases examined, including the well-known ‘Gauss 
elimination process’, it is found that the errors are normally quite moderate: no 
exponential build-up need occur. 
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Simplification 1 : cost model 


Cost model. Use some basic operation as a proxy for running time. 



0 + 1 + 2 + .. . + (AT- 1) 


operation 

frequency 

variable declaration 

N + 2 

assignment statement 

N + 2 

less than compare 

!4 (N + 1 ) (N + 2) 

equal to compare 

J4 N (N - 1 ) 

array access 

N (N - 1) -* — 

increment 

'A N (N - 1 ) to N (N - 1 ) 


1 

2 


N(N-l) 

N 
2 


cost model = array accesses 

(we assume compiler/JVM do not 
optimize array accesses away!) 
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Simplification 2: tilde notation 

• Estimate running time (or memory) as a function of input size N. 

• Ignore lower order terms. 

- when N is large, terms are negligible 

- when N is small, we don't care 


Ex 1 . VeN 3 + 20 N + 16 
Ex 2. VeN 3 + 100 IV 4/3 + 56 

Ex 3. VsN 3 - i/ 2 N 2 + Vi N 


j 


Vs N 3 
Ye N 3 
VtsN 3 



1 66,666,667 \ // N 3 /6 - N 2 /2 + N/3 


166,167,000 


Y 

discard lower-order terms 
(e.g., N = 1000: 500 thousand vs. 166 million) 


1,000 

Leading-term approximation 


Technical definition. KN)~g(N) means lim ^ = 1 

00 g(N) 
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Simplification 2: tilde notation 


• Estimate running time (or memory) as a function of input size N. 

• Ignore lower order terms. 

- when N is large, terms are negligible 

- when N is small, we don't care 


operation 

frequency 

tilde notation 

variable declaration 

N + 2 

~ N 

assignment statement 

N + 2 

~ N 

less than compare 

MN + 1 ) (N + 2) 

~ K 2 N 2 

equal to compare 

/ 2 N (N - 1 ) 

~ / 2 N 2 

array access 

N (N - 1 ) 

~ N 2 

increment 

Yi N (N - 1 ) to N (N - 1 ) 

~ >4 N 2 to ~ N 2 
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Example: 2-SUM 


Q. Approximately how many array accesses as a function of input size N? 



Bottom line. Use cost model and tilde notation to simplify counts. 
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Example: 3-SUM 


Q. Approximately how many array accesses as a function of input size N? 


A. 



Bottom line. Use cost model and tilde notation to simplify counts. 
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Estimating a discrete sum 


Q. How to estimate a discrete sum? 

A1 . Take discrete mathematics course. 

A2. Replace the sum with an integral, and use calculus! 


Ex 1 . 1 + 2 + . . . + N. 


Ex 2. l* + 2 k +...+N k . 


N 

E i ~ 
1=1 



sj 


1 

2 


N 


2 


N 


E* 

2=1 


k 


>N 


x—1 


x k dx 


k + 1 


jV fc+1 


Ex B. 1 + 1/2 + 1/3 + ... + 1 IN. 



2=1 





IniV 


N N N 

Ex 4. 3-sum triple loop. y^y^y^ 1 

2=1 j —i k=j 



>N rN 


y=x J z=y 


dz dy dx 



rsj 




Mathematical models for running time 


In principle, accurate mathematical models are available. 


In practice, 

• Formulas can be complicated. 

• Advanced mathematics might be required. 

• Exact models best left for experts. 



costs (depend on machine, compiler) 

^ y \ 

Tn — ci A + ci B + C3 C + C 4 D + C5 E 

A = array access 
B = integer add 
C= integer compare 
D = increment 
E = variable assignment 



frequencies 

(depend on algorithm, input) 


Bottom line. We use approximate models in this course: T(N) ~ cN 3 . 
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rvetficps — ^r, 

fjjrfdtbematica^ models 

► order-of-growth classifications 


Common order-of-growth classifications 

order of growth discards 

Good news, the small set of functions leading coefficient 

1, log N, N, N log N, N 2 , N 3 , and 2 N 

suffices to describe order-of-growth of typical algorithms. 


log-log plot 



IK 


2K 4K 8K ^ 

Typical orders of growth 


512K 
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Common order-of-growth classifications 


order of 
growth 

name 

typical code framework 

description 

example 

1 

T(2N) / T(N) 

1 

constant 

a = b + c; 

statement 

add two 

numbers 

1 

log N 

logarithmic 

while (N > 1) 

{ N = N / 2; ... } 

divide in half 

binary search 

~ 1 

N 

linear 

for (int i = 0; i < N; i++) 

{ ... } 

loop 

find the 

maximum 

2 

N log N 

linearithmic 

[see mergesort lecture] 

divide 

and conquer 

mergesort 

~ 2 

N 2 

quadratic 

for (int i = 0; i < N; i++) 
for (int j =0; j < N; j++) 

{ ... } 

double loop 

check all 
pairs 

4 

N3 

cubic 

for (int i = 0; i < N; i++) 
for (int j =0; j < N; j++) 
for (int k = 0; k < N; k++) 

{ ... } 

triple loop 

check all 
triples 

8 

2 N 

exponential 

[see combinatorial search lecture] 

exhaustive 

search 

check all 

subsets 

T(N) 


Practical implications of order-of-growth 


growth 


problem size solvable in minutes 

■ 

rate 

1970s 

1980s 

1990s 

2000s 

1 

any 

any 

any 

any 

log N 

any 

any 

any 

any 

N 

millions 

tens of 
millions 

hundreds of 
millions 

billions 

N log N 

hundreds of 
thousands 

millions 

millions 

hundreds of 
millions 

tens of 

thousands 

N 2 

hundreds 

thousand 

thousands 


hundred 

hundreds 

thousand 

thousands 

2 N 

20 

20s 

20s 

30 


Bottom line. Need linear or linearithmic alg to keep pace with Moore's law. 
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Binary search demo 


Goal. Given a sorted array and a key, find index of the key in the array? 

Binary search. Compare 

• Too small, go left. 

• Too big, go right. 

• Equal, found. 


key against middle entry. 
© 


successful search for SB 


6 

o 

t 

lo 


13 14 25 33 43 51 53 64 72 84 93 95 96 97 


10 


i l 


12 


13 


T 

hi 
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Binary search: Java implementation 


Trivial to implement? 

• First binary search published in 1 946; first bug-free one in 1 962. 

• Bug in Java's Arrays. binarySearchQ discovered in 2006. 


public static int bi narySearch(i nt [] a, int key) 

{ 

int lo = 0, hi = a. length-1; 
while (lo <= hi) 

{ 

int mid = lo + (hi - lo) / 2; 
if (key < a [mid]) hi = mid - 1; 

else if (key > a [mid]) lo = mid + 1; 
else return mid; 

} 

return -1; 


one "3-way compare" 


Invariant. If key appears in the array a[], then a[lo] < key < a [hi]. 
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Binary search: mathematical analysis 


Proposition. Binary search uses at most 1 +lg7V key compares to search in 
a sorted array of size N. 

Def. T(N) = # key compares to binary search a sorted subarray of size < N. 
Binary search recurrence. T(N) < T(N/2) + 1 for N > 1, with 7(1) = 1 . 

t t 


left or right half 

possible to implement with one 

Pf sketch. 

2-way compare (instead of B-way) 

T(N) < T(N/2) + 1 

given 

< T(N/4) +1 + 1 

apply recurrence to first term 

< T(N/8) +1 + 1 + 1 

apply recurrence to first term 


< T(N / N) + 1 + 1 + ... + 1 stop applying, T(l) = 1 

= 1 + lg N 
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An N 2 log N algorithm for 3-SUM 


Sorting-based algorithm. 

• Stepl: Sort the N (distinct) numbers. 

• Step 2: For each pair of numbers a[i] 
and a[j] , binary search for -Ca[i] + a[j]). 


Analysis. Order of growth is TV 2 log N. 

• Step 1: TV 2 with insertion sort. 

• Step 2: TV 2 log TV with binary search. 


0 10 5 


input 

30 -40 -20 -10 40 


sort 

-40 -20 -10 0 5 10 30 40 


binary search 


(-40, 

-20) 


(-40, 

-10) 

(40) 

(-40, 

0) 

(-40, 

5) 


(-40, 

10) 


1 

• 4^ * 

O 

40) 

JB* 

(-20, 

-10) 

(JO) 

only count if 



a[i] < a[j] < a[k] 

(-10, 

0) 

CjLO m — y to avoid 

( 10, 

30) 

/ double counting 

-AQ * 

( 10, 

40) 


( 30, 

40) 
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Comparing programs 


Hypothesis. The sorting-based N 2 logN algorithm for 3 -Sum is significantly 
faster in practice than the brute-force TV 3 algorithm. 


N 

time (seconds) 

1,000 

0.1 

2,000 

0.8 

4,000 

6.4 

8,000 

51.1 


ThreeSum.java 


N 

time (seconds) 

1,000 

0.14 

2,000 

0.18 

4,000 

0.34 

8,000 

0.96 

16,000 

3.67 

32,000 

14.88 

64,000 

59.16 


ThreeSumDeluxe.java 


Guiding principle. Typically, better order of growth => faster in practice. 
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irvqtion 

rjfidthematicql models 
► oraer-Qt-growth classifications 


► theory of algorithms 




Types of analyses 


Best case. Lower bound on cost. 

• Determined by “easiest” input. 

• Provides a goal for all inputs. 

Worst case. Upper bound on cost. 

• Determined by “most difficult” input. 

• Provides a guarantee for all inputs. 

Average case. Expected cost for random input. 

• Need a model for “random” input. 

• Provides a way to predict performance. 


Ex 1 . Array accesses for brute-force 3 -Sum. 


Ex 2. Compares for binary search. 

Best: ~ V 2 N 3 


Best: ~ 1 

Average: ~‘/ 2 iV 3 


Average: ~ 1 gN 

Worst: ~ViN 3 


Worst: ~ lgTV 
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Types of analyses 


Best case. Lower bound on cost. 
Worst case. Upper bound on cost. 
Average case. “Expected” cost. 


Actual data might not match input model? 

• Need to understand input to effectively process it. 

• Approach 1 : design for the worst case. 

• Approach 2: randomize, depend on probabilistic guarantee. 
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Theory of algorithms 


Goals. 

• Establish “difficulty” of a problem. 

• Develop “optimal” algorithms. 

Approach. 

• Suppress details in analysis: analyze “to within a constant factor”. 

• Eliminate variability in input model by focusing on the worst case. 

Optimal algorithm. 

• Performance guarantee (to within a constant factor) for any input. 

• No algorithm can provide a better performance guarantee. 
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Commonly-used notations in the theory of algorithms 


notation 

provides 

example 

shorthand for 

used to 

Big Theta 

asymptotic 
order of growth 

0(N 2 ) 

/ 2 N 2 

10 N 2 

5 N 2 + 22 N log N+ 3N 

classify 

algorithms 

Big Oh 

0(N 2 ) and smaller 

0(N 2 ) 

10 N 2 

100 N 

22 N log N + 3 N 

develop 
upper bounds 

Big Omega 

0(N 2 ) and larger 

0(N 2 ) 

/ 2 N 2 

N5 

N3+ 22 N log N+ 3 N 

develop 
lower bounds 


Theory of algorithms: example 1 


Goals. 

• Establish “difficulty” of a problem and develop “optimal” algorithms. 

• Ex. 1 -Sum = “As there a 0 in the array? ” 

Upper bound. A specific algorithm. 

• Ex. Brute-force algorithm for 1-Sum: Look at every array entry. 

• Running time of the optimal algorithm for 1-Sum is 0(A/). 

Lower bound. Proof that no algorithm can do better. 

• Ex. Have to examine all N entries (any unexamined one might be 0). 

• Running time of the optimal algorithm for 1-Sum is Q(/V). 

Optimal algorithm. 

• Lower bound equals upper bound (to within a constant factor). 

• Ex. Brute-force algorithm for 1-Sum is optimal: its running time is 0(/V). 
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Theory of algorithms: example 2 


Goals. 

• Establish “difficulty” of a problem and develop “optimal” algorithms. 

• Ex. 3-Sum. 


Upper bound. A specific algorithm. 

• Ex. Brute-force algorithm for 3-Sum. 

• Running time of the optimal algorithm for 3-Sum is 0(A/ 3 ). 
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Theory of algorithms: example 2 


Goals. 

• Establish “difficulty” of a problem and develop “optimal” algorithms. 

• Ex. 3 -Sum. 


Upper bound. A specific algorithm. 

• Ex. Improved algorithm for 3 -Sum. 

• Running time of the optimal algorithm for 3 -Sum is 0(A/ 2 log A/). 

Lower bound. Proof that no algorithm can do better. 

• Ex. Have to examine all N entries to solve 3 -Sum. 

• Running time of the optimal algorithm for solving 3 -Sum is Q(A/). 

Open problems. 

• Optimal algorithm for 3 -Sum? 

• Subquadratic algorithm for 3 -Sum? 

• Quadratic lower bound for 3 -Sum? 
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Algorithm design approach 


Start. 

• Develop an algorithm. 

• Prove a lower bound. 

Gap? 

• Lower the upper bound (discover a new algorithm). 

• Raise the lower bound (more difficult). 

Golden Age of Algorithm Design. 

• 1970s-. 

• Steadily decreasing upper bounds for many important problems. 

• Many known optimal algorithms. 

Caveats. 

• Overly pessimistic to focus on worst case? 

• Need better than “to within a constant factor” to predict performance. 
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Commonly-used notations 


notation 

provides 

example 

shorthand for 

1 

used to 

Tilde 

leading term 

~ 1 0 N 2 

10 N 2 

10 N 2 + 22 N log N 

10 N 2 + 2 N + 37 

provide 

approximate 

model 

Big Theta 

asymptotic 
growth rate 

0(N 2 ) 

14 N 2 

10 N 2 

5 N 2 + 22 N log N+ 3N 

classify 

algorithms 

Big Oh 

0(N 2 ) and smaller 

0(N 2 ) 

10 N 2 

100 N 

22 N log N + 3 N 

develop 
upper bounds 

Big Omega 

0(N 2 ) and larger 

n(N 2 ) 

14 N 2 

N 5 

N 2 + 22 N log N + 3 N 

develop 
lower bounds 


Common mistake. Interpreting big-Oh as an approximate model. 
This course. Focus on approximate models: use Tilde-notation 
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1 .4 Analysis of Algorithms 


► mathematical models 
rder-of-grc 



c icatk?ns 


► memory 



Basics 


Bit. 0 or 1 . 
Byte. 8 bits. 
Megabyte (MB). 
Gigabyte (GB). 


NIST most computer scientists 

1 1 

1 million or 2 20 bytes. 

1 billion or 2 30 bytes. 



64-bit machine. We assume a 64-bit machine with 8 byte pointers. 

• Can address more memory. 

• Pointers use more space. 

\ 

some JVMs "compress" ordinary object 
pointers to 4 bytes to avoid this cost 
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Typical memory usage for primitive types and arrays 


type 

bytes 

boolean 

1 

byte 

1 

char 

2 

int 

4 

float 

4 

long 

8 

double 

8 


for primitive types 


type 

bytes 

char [] 

2N + 24 

int[] 

4N + 24 

doublet] 

8N + 24 


for one-dimensional arrays 


type 

bytes 

char[] □ 

~2MN 

i nt t] t] 

~ 4 M N 

doublet] t] 

~ 8 M N 


for two-dimensional arrays 
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Typical memory usage for objects in Java 


Object overhead. 1 6 bytes. 

Reference. 8 bytes. 

Padding. Each object uses a multiple of 8 bytes. 


Ex 1 . A Date object uses 32 bytes of memory. 


public class Date 

{ 

private int day; 
private int month; 
private int year; 

■ ■ ■ 

} 


32 bytes 


object 

overhead 


day 


month 


year 


padding 



int 

values 


1 6 bytes (object overhead) 


4 bytes (int) 

4 bytes (int) 

4 bytes (int) 

4 bytes (padding) 
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Typical memory usage for objects in Java 


Object overhead. 1 6 bytes. 

Reference. 8 bytes. 

Padding. Each object uses a multiple of 8 bytes. 


Ex 2. A virgin Stri ng of length N uses ~ 2N bytes of memory. 


public class String 

{ 

private char[] value; 
private int offset; 
private int count; 
private int hash; 

■ ■ ■ 

} 


2N + 64 bytes 


object 

overhead 


val ue 


offset 


count 


hash 


padding 


reference 


int 

values 


1 6 bytes (object overhead) 


8 bytes (reference to array) 
2N + 24 bytes (char[] array) 

4 bytes (int) 

4 bytes (int) 

4 bytes (int) 

4 bytes (padding) 
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Typical memory usage summary 


Total memory usage for a data type value: 

• Primitive type: 4 bytes for int, 8 bytes for double, ... 

• Object reference: 8 bytes. 

• Array: 24 bytes + memory for each array entry. 

• Object: 1 6 bytes + memory for each instance variable 
+ 8 bytes if inner class (for pointer to enclosing class). 

• Padding: round up to multiple of 8 bytes. 


Shallow memory usage: Don't count referenced objects. 

Deep memory usage: If array entry or instance variable is a reference, 
add memory (recursively) for referenced object. 
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Example 


Q. How much memory does WeightedQuickUnionUF use as a function of TV? 
Use tilde notation to simplify your answer. 


public WeightedQui ckUnionUF(i nt N) 

{ 

i d = new i nt [N] ; 
sz = new i nt [N] ; 

for (int i =0; i < N; i++) id[i] = i; 
for (int i =0; i < N; i++) sz[i] = 1; 


public class WeightedQuickUnionUF <■ 

{ 

private int[] id; + 
private int[] sz; * 
private int count; * 


1 6 bytes 

(object overhead) 

8 + (4N + 24) each 
reference + int[] array 

4 bytes (int) 

4 bytes (padding) 


} 


A. 8TV+88- 8 TV bytes. 
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1 .4 Analysis of Algorithms 


► mathematical models 
rder-of-grc 



c icatk?ns 


► memory 



Turning the crank: summary 


Empirical analysis. 

• Execute program to perform experiments. 

• Assume power law and formulate a hypothesis for running time. 

• Model enables us to make predictions. 

Mathematical analysis. 

• Analyze algorithm to count frequency of operations. 

• Use tilde notation to simplify analysis. 

• Model enables us to explain behavior. 

Scientific method. 

• Mathematical model is independent of a particular system; 
applies to machines not yet built. 

• Empirical analysis is necessary to validate mathematical models 
and to make predictions. 
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